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On a class of quantum channels which includes the depolarizing channel, the highest fidelity 
of quantum error-correcting codes of length n and rate R is proven to be lower bounded by 1 — 
exp[— nE(R) + o(n)] for some function E(R). The E(R) is positive below some threshold Ro, which 
implies Rq is a lower bound on the quantum capacity. 
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I. INTRODUCTION 

Quantum error-correcting codes (simply called codes in 
this paper) are deemed indispensable for quantum com- 
putation as schemes that protect quantum states from 
decoherence. An information theoretic problem relevant 
to such codes is one of determining the quantum capac- 
ity of a channel, which is far from settled Jl], ^, ^J. 
This paper treats a problem closely related to the quan- 
tum capacity. The corresponding problem in classical 
information theory is that of determining the the high- 
est error exponent, called the reliability function, of a 
channel H fjj, which is briefly reviewed here. A classi- 
cal memoryless channel over a finite alphabet A" is a set 
of conditional probabilities {W(v\u)} u ^, e x ■ A classical 
code is a pair (C, V) consisting of a codeword set C C X n 
and a decoding map V : X n — > C. The performance of a 
classical code is evaluated in terms of maximum decoding 
error probability 

P c (C,P)=max J2 Wn (y\ x )' 

where W n {y\ . . . y n \xx . . . x n ) = W(yi\xi) . . . W{y n \x n ). 
For fixed n and k, let P* k denote the minimum of 
P e (C, T>) over all possible choices of (C, T>) with log d / \C\ > 
k, where the base d! > 1 is arbitrarily fixed. Shannon's 
channel coding theorem states that if a rate R is less than 



the capacity C(W) of the channel W, then P* 



n.Rn 



0. 



A stronger result of large-deviation theoretic appearance 
has been known || ffl, |l p|: There exists a function 
E r (R, W), called the random coding exponent of W , such 
that 



liminf-ilog d , P* tRn >E T (R, W), 



Le -> K, Rn ^ex Pd , 



-nE T (R,W)}, and 



E Y (R,W)>0 if R<C(W). 



(1) 



(2) 
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The function E*{R, W) = liminf„ -± log rf , P* >fln , which 
is called the reliability function of W , shows a trade-off 
between the reliability and data transmission rate of the 
best codes on channel W. The E*(R, W) actually equals 
E r (R, W) for relatively large rates R, and complete de- 
termination of E* (R, W) is one of the central issues in 
classical information theory || [?], ]§[ |) . Note that Shan- 
non's coding theorem directly follows from (|l|) and (||). 

Motivated by this classical issue, this paper presents an 
exponential lower bound on the highest possible fidelity 
of a code used on a class of quantum channels, which 
includes the depolarizing channel often discussed in the 
literature @, |, 0. This work was inspired by the recent 
result of Matsumoto and Uyematsu who used an al- 
gebraic fact due to Calderbank et al |12| [Eq. ^ below] 
to deduce a lower bound on the quantum capacity. This 
work's approach resembles theirs in that both bounds are 
shown using random coding arguments f| (?], |l3| based 
on (^) , but differs from [jllj in that while jllj uses an ana- 
log of minimum Hamming distance decoding, this work 
employs an analog of minimum entropy decoding |l4|] to- 
gether with the method of types from classical informa- 
tion theory [Q, |lj, [l5| , which enables us to obtain the 
exponential bound analogous to ([!]) in a simple enumer- 
ative manner. 



II. EXPONENTIAL BOUND ON FIDELITY 

We follow the standard formalism of quantum informa- 
tion theory which assumes all possible quantum opera- 
tions and state changes, including the effects of quantum 
channels, are described in terms of completely positive 
(CP) linear maps [[j], |], In this paper, only trace- 
preserving completely positive (TPCP) linear maps are 
treated. Given a Hilbert space H of finite dimension, let 
L(H) denote the set of linear operators on H. In gen- 
eral, every CP linear map M. : L(H) — > L(H) has an 
operator-sum representation M(p) = MipM} for 

some Mi £ L(H), i e 1 |[ 0. When M is specified by 
a set of operators {Mij^gi, which is not unique, in this 
way, we write M. ~ {Mi}i e x- 

Hereafter, H denotes an arbitrarily fixed Hilbert space 



2 



whose dimension d is a prime number. A quantum chan- 
nel is a sequence of TPCP linear maps {A n ■ L(H®") — > 
L(H® n )}. We want a large subspace C = C n C H®" in 
which every state vector remains almost unchanged af- 
ter the effect of a channel followed by the action of some 
suitable recovery process. The recovery process is again 
described as a TPCP linear map TZ n : L(H® n ) -> L(H® n ). 
A pair (C n ,lZ n ) consisting of such a subspace C n and a 
TPCP linear map lZ n is called a code and itsperformance 
is evaluated in terms of minimum fidelity || |], [TtJ 



where 



F{C ni 7Z. n An) — 



min (ip\K n A n (\ip) (if>\)\ip) , 
|i/>)ec„ 



where lZ n A n denotes the composition of A n and lZ n . 
Throughout, bras (-| and kets •) are assumed normal- 
ized. A subspace C n alone is also called a code assuming 
implicitly some recovery operator. Let F* k (A n ) denote 
the supremum of F(C n ,lZ n A n ) such that there exists a 
code (C n ,lZ n ) with log d dimC rl > k. This paper gives an 
exponential lower bound on F* k (A n ), in the case where 
{^4 n } is a slight generalization of the depolarizing channel 
specified as follows. 

Fix an orthonormal basis {|0), . . . , \d — 1)} of H. Put 
X = {0, . . . ,d - l} 2 and N (id) = X l Z j for G X, 

where the unitary operators X, Z G L(H) are defined by 



X\j) = \(j-l)modd), Z\j)=<J\j) 



(3) 



with ui being a primitive d-th root of unity |18|, |9|. 
The {N u } u< zx is a basis of L(H) and a generalization of 
the Pauli operators (including the identity) in that when 
d = 2, the basis {7, X, XZ, Z} is the same as the set of 
Pauli operators up to a phase factor. For simplicity, we 
confine ourselves to treating analogs of what are called 
memoryless channels in classical information theory, i.e., 
those {A n } such that A n = A® n , n = 1,2, ... , for some 
A : L(H) -> L(H); such a channel {A® n } is referred to 
as the memoryless channel A. In addition, we treat only 
channels that can be written as A ~ {\/P(u)N u } u ex, 
where P is a probability distribution on X. This restric- 
tion is mainly due to that the codes to be proven to have 
the desired performance are symplectic ( stabilizer, or ad- 
ditive) codes @ [0| |0|, §1], |2|, which exploit some 
algebraic property of the basis {N u } u£ x, arid that anal- 
ysis of code performance naturally turns out to be easy 
for this class of channels. Analysis for a wider class of 
channels will be given in future papers. 

As is usual in information theory, the classical 
informational divergence or relative entropy is de- 
noted by D and entropy by H [7|, [l5|: for prob- 
ability distributions P and Q on a finite set X, 
D{P\\Q) = T, x eX P ^)log d [P{x)/Q{x)] and H(Q) = 
— J2 X £X Q( x ) l°Sd Qi x )- This paper's main result is 

Theorem 1 Let integers n, k and a real number R sat- 
isfy < k < Rn and < R < 1 (a typical choice is 
k = lRn\ for an arbitrarily fixed rate R). Then, for a 
memoryless channel A ~ {y/ P(u)N u } u( zx , we have 

F* k {A® n ) > 1 - (n + l)2(d 2 -l) d -nE(R,P) 



E{R,P) 



mm[L>(Q||P) 



|1 - H(Q) - R\ 



\x\ + = max{x,0} 7 and the minimization with respect to 
Q is taken over all probability distributions on X. 

Remarks: An immediate consequence of the theorem 
is that the quantum capacity jl], g, Q of A is lower 
bounded by 1 — H{P). To see this, observe that E(R, P) 
is positive for R < 1 — H(P) due to the basic inequality 
D(Q\\P) > where equality occurs if and only if Q = 
P |j. The bound 1 — H(P) appeared earlier in [tLCfl , 
Sec. 7.16.2. 

Another direct consequence of the theorem is 

liminf -1 \og d {l - Fl [Rn] (A® n )} > E(R, P), (4) 

which resembles (|^). In fact, we can see that E{R, P) is 
closely related to E r (R, W) in (0) as follows. A specific 
form of E r is E r (R, W) — max p E r (R,p, W), where 



E T (R,p, W) = mm[D{V\\W\p) 



\I{p,V)-R\ 



see Q for detail. For consistency, we assume all loga- 
rithms appearing in the definitions of the rate of a code j^] 
and functions D, I are to base d. The function E(R, P) 
coincides with E r (R + l,p, W) with p being the uniform 
probability distribution on X and W being the channel 
defined by W-^vlu) = P(v — u), u,v G X — Fd, where 
denotes the finite field consisting of d elements. Rewrit- 
ing E T (R, p, W) into the other well-known form (see , 
pp. 168, 192-193, and || ||), we have another form of E: 

E(R, P) = ma* -S(R - 1) - (5 + 1) log, £ P(u)^i . 

Furthermore, putting P$(u) — P(w) 7 + T / J2vex 7 > (w) 7 + T , 
u G X, and Rs = 1 — H(Ps), we obtain 

(-R + l-2log d Y, u P(u)$ H0<R<R u 
E(R,P)=Id(P s *\\P) ifi?!<i?<i? , 
[ ilR <R, 

where 5* is a S with Rg — R; see FIG. [I]. 



III. QUANTUM ERROR-CORRECTING CODES 

To prove the theorem, we use a lemma on codes for 
quantum channels. We can regard the index of -/V(ij) = 
X l Z\ (i,j) G X, as a pair of elements from the field 
F = = Z/o!Z. From these, we obtain a basis N„ = 
{N x | x G (F 2 )"} of L(H® n ), where N x =N Xl ®...®N Xn 
for x = (xi,...,x n ) G (F 2 ) n . We write Nj for {N x G 
N„ | x G J} where J C (F 2 )". The index of a basis 
element 

2\n 



((ui,«i), . . . , {u n ,v n )) G (F ) 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 
R 



FIG. 1: The function E(R,P) for the depolarizing channel, 
where d = 2 and P((0, 0)) = 1 - 3e, P(u) = e for u / (0, 0), 
U & X = {0, l} 2 , with s = 0.0025. 



can be regarded as the plain 2n-dimensional vector 

X = (Ui,Ui, . . .,Un,V n ) G F 2n . 

We can equip the vector space F 2 ™ over F with a sym- 
plectic paring (bilinear form, or inner product) defined 
by 



£ 

i=l 



for the above x and y = (it^, v[, . . . , u' n , v' n ) G F 2n |2|, |I| 
Given a subspace L C F 2n , let 

i ± = {.xG F 2 ™ | VyeL, (z,y) sp = 0}. 



Lemma 1 J7j}/ Let a subspace L C F 2n satisfy L C L 1 - 
and dimL = n — k. Choose a set J C F 2 ™, noi necessarily 
linear, such that 



{y 



xeJ,ye J} c (L 1 - \ L) c 



where the superscript C denotes complement. Then, there 
exist d k -dimensional N j- correcting codes. 

The codes in the lemma have the form {-0 G H® n | VM G 
7V L , M^j = r(M)ip} with some scalars r(M), M G 7V L - 
A precise definition of iVj-correcting codes can be found 
in Sec. Ill of |l7j] and the above lemma has been verified 
with Theorem III. 2 therein. Most constructions of quan- 
tum error-correcting codes relies on this lemma, which is 
valid even if d is a prime other than two [^8[ [l9[ |l[ p2[ . 

Now, for a memoryless channel A ~ {^P(u)N u } u£ x, 
and an iVj-correcting code C C H®", write 

P(C) =su P F(C,7^„^ n ) 

where 7?.„ ranges over all TPCP linear maps on L(H® n ). 
Then, since a recovery operator TZ n can be constructed 
explicitly so as to correct all errors in Nj, as in the proof 
of Theorem III. 2 of l|r7|, we have 



l-F(C)<^P n (x), 



(5) 



x4J 



IV. PROOF OF THEOREM 1 

We employ the method of types |?], [l4| [ll| , on which 
a few basic facts to be used are collected here. For x = 
(xi, . . . , x n ) G <-f", define a probability distribution P x 
on X by 

Pa(w) = \{i | 1 < i < n, Xi = u}\/n, u G X, 

which is called the type (empirical distribution) of x. 
With X fixed, the set of all possible types of sequences 
from X n is denoted by Q n {X) or simply by Q n . For a 
type Q G Q n , Tq is denned as {x G X n \ P x = Q}. In 
what follows, we use 



\Qn\ < (n + 1) 



1*1 



and VQ G Qn, |T"| <d" ff W). 



(6) 



Note that if x G 

Ua eX P(a) nQ{a) =ex 
We apply Lemma I 
dimL = n — k. Then, dimL 
of the d"~ fc cosets of L 



X n has type Q, then P"(x) 
id {-n[#(Q) + L>(Q||P)]}. 



choosing J as follows. Assume 
n + k |2j| . From each 
in F 2n , select a vector that 



minimizes H(P X ), i.e., a vector x satisfying H(P X ) < 
H(P y ) for any y in the coset. This selection uses the idea 
of the minimum entropy decoder known in the classical 
information theory literature H]. Let Jq(L) denote the 
set of the d n ~ k selected vectors. If we take J in Lemma |l| 
as J(L) = {z + w | z G Jq(L),w G L), the condition in 
the lemma is clearly satisfied. Let 

A = {L C F 2n | L linear, L C L^, dimL = n - jfc}, 

and for each L G A, let C(L) be an Nj/n -correcting code 
existence of which is ensured by Lemma [l]. Put 



LGA 

We will show that P is bounded from below by 1 — (n + 
l)2(d 2 -l) rf -nS(R,P) j which establishes the theorem. Such 
a method for a proof is called random coding [Q O, [l3| . 

The {0, l}-valued indicator function 1[T] equals 1 if 
and only if the statement T is true and equals otherwise. 
From (|5|), we have 

1 1 L£Ax£J(L) 

4ee p»(*)i[*^j(l)] 



LeA^eF 2 " 



E pn (*) 



|B(x) 



(7) 



where we have written P n (x\ . . . x n ) for P(xi) . . . P(x Tl 



where we have put 

B(a:) = {LeA\xi J{L)}, x G F 2,i . 
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The fraction |B(x)|/|A| is trivially bounded as 

M<1 x e F 2n 
|A| " ' • 

We use the next inequality Jn], [l2| . Let 

A(x) = {L e A | x e L 1 - \L}. 

Then, |A(0)| = and 

1 < - — -, x 6 F , x ^ 0. 



(8) 



|A| " d"- fc 



(9) 



Since B(x) C{LeA\3ye F 2n ,H(P y ) < H(P x ),y-x e 
L^\L} from the design of J(L) specified above (cf. |p^|), 

|B(z)| < J2 \Hv-*)\ 

y£f^:H(P y )<H(P x ), y=£x 

< Yl \A\d- n+k , (10) 

y£F 2 ":H(P v )<H(P x ), y^x 

where we have used (0) for the latter inequality. Com- 
bining (||) and (|op, we can proceed as follows with 
the aid of the basic inequalities in (^) and the inequality 
min{a + 6, 1} < min{a, 1} + min{6, 1} for a, b > 0. 



i-F < pn ( x ) m[n \ E d- (n - k \ 1 

xGF 2 ™ I yeF 2 ": H{P y )<H(P x ), y=£x 



7'n 



-RY 



< e i^i n p ^ nQia) ™4 e J 

QeQn a£X L Q'eQ„:H{Q')<H{Q) 

< exp d [-nD(Q\\P)} ]T exp d [-n|l - R - H(Q')\+] 

QeSn Q'GQ„: H(Q')<H(Q) 

< ex Pd {-nD(Q\\P)}\Q n \ max exp d [-n|l - E - H(Q')\+] 
- E \Qn\exp d [-nD(Q\\P)-n\l-R-H(Q)\+] 

< {n + lf d2 -^ C ^ d [-nE(R,P% 



r 



which is the promised bound. 



V. CONCLUDING REMARK 



ments, and especially, pointing out an error in the earlier 
manuscript, M. Hayashi and K. Matsumoto for helpful 
comments, and H. Imai for support. 



This author conjectures that the bound in (||) is not 
tight in view of the existence of the Shor-Smolin codes [pi . 
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